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Abstract 

The Golden Code is a full-rate full-diversity space-time code, which achieves maximum coding gain for 
Multiple-Input Multiple-Output (MIMO) systems with two transmit and two receive antennas. Since four information 
symbols taken from an A/-QAM constellation are selected to construct one Golden Code codeword, a maximum 
likelihood decoder using sphere decoding has the worst-case complexity of 0(M 4 ), when the Channel State 
Information (CSI) is available at the receiver. Previously, this worst-case complexity was reduced to 0(M 2 5 ) 
without performance degradation. When the CSI is known by the transmitter as well as the receiver, beamforming 
techniques that employ singular value decomposition are commonly used in MIMO systems. In the absence of 
channel coding, when a single symbol is transmitted, these systems achieve the full diversity order provided by the 
channel. Whereas this property is lost when multiple symbols are simultaneously transmitted. However, uncoded 
multiple beamforming can achieve the full diversity order by adding a properly designed constellation precoder. 
For 2x2 Fully Precoded Multiple Beamforming (FPMB), the general worst-case decoding complexity is O(M). In 
this paper, Golden Coded Multiple Beamforming (GCMB) is proposed, which transmits the Golden Code through 
2x2 multiple beamforming. GCMB achieves the full diversity order and its performance is similar to general 
MIMO systems using the Golden Code and FPMB, whereas the worst-case decoding complexity of 0(y/M) is 
much lower. The extension of GCMB to larger dimensions is also discussed. 
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I. Introduction 

The Golden Code is a space-time code for Multiple-Input Multiple-Output (MIMO) systems with two 
transmit and two receive antennas HI, 0. It achieves both the full rate and the full diversity. Its Bit Error 
Rate (BER) performance is so far the best compared to other 2x2 space-time codes. Moreover, it has 
a nonvanishing coding gain that is independent of the size of the signal constellation, thus it achieves 
the optimal diversity-multiplexing performance presented in (3). Because of those advantages, the Golden 
Code has been incorporated into the 802. 16e WiMAX standard JH. 

Since each Golden Code codeword employs four information symbols from an M-QAM constellation, 
M 4 points are calculated by exhaustive search to achieve the Maximum Likelihood (ML) decoding. Hence, 
the decoding complexity is proportional to M 4 , denoted by C(M 4 ). Sphere Decoding (SD) is an alternative 
for ML with reduced complexity [5|. While SD reduces the average decoding complexity, the worst-case 
complexity is still C(M 4 ). 

To reduce the decoding complexity of the Golden Code, several techniques have been proposed. In 
1161 , 0, the worst-case complexity of the Golden Code is reduced to 0(M 2 - 5 ) without performance 
degradation. In [HI, an improved sphere decoding for the Golden Code is designed to reduce the average 
decoding complexity. In [9|, a decoding technique with the complexity of 0(M 2 ) is presented, which is 
based on the Diophantine approximation and with the trade-off of 2dB performance loss. Other suboptimal 
decoders for the Golden Code are discussed in ifTDl and [fTTTl . 

When channel state information (CSI) is available at the transmitter, beamforming techniques, which 
exploit Singular Value Decomposition (SVD), are applied in a MIMO system to achieve spatial multiplex- 
ing and thereby increase the data rate, or to enhance the performance lfl2l . However, spatial multiplexing 
without channel coding results in the loss of the full diversity order [13]. To overcome the diversity 
degradation of multiple beamforming, the constellation precoding technique can be employed Ifl4l . It is 
shown in [fl4l that Fully Precoded Multiple Beamforming (FPMB) achieves full diversity. 

In this paper, the technique of Golden Coded Multiple Beamforming (GCMB) that combines the Golden 
Code with 2x2 multiple beamforming is proposed. GCMB achieves both the full rate and the full 
diversity similar to the general MIMO systems employing the Golden Code and 2x2 FPMB. All these 
three techniques have almost the same BER performance. However, the worst-case decoding complexity 
of GCMB is reduced to 0(VM) compared to general MIMO systems using the Golden Code. This 
complexity is lower than FPMB as well, whose worst-case complexity is 0(M). 
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The remainder of this paper is organized as follows: In Section HH the description of GCMB is given. 
In Section Unl the diversity analysis of GCMB is provided. In Section [IVj the decoding technique and 
complexity of GCMB are shown. In Section |Vl the extension of GCMB to larger dimensions is discussed. 
In Section [VO performance comparisons of different techniques are carried out. Finally, a conclusion is 
provided in Section IVIIl 

II. GCMB Overview 

Fig. Q] represents the structure of GCMB. Firstly, the information bit sequence is modulated by the M- 
ary square QAM. Then four consecutive modulated complex- valued scalar symbols si, s 2 , S3, and S4 are 
encoded into the Golden Code codewords. The codewords X of the Golden Code are 2 x 2 complex- valued 
matrices 0], given as 

(1 + i0)s\ + (a - i)s 2 (1 + i/3)ss + (a - i)s A 

1 0) 

(i - a)s 3 + (1 + i/3)s4 (1 + ia)si + ((5 - i)s 2 

with a = ^fi and (5 = (^fi). 

The MIMO channel H 6 (£N r xN t - ls assume d to be quasi-static, Rayleigh, and flat fading, and known 
by both the transmitter and the receiver, where N r = N t = 2 denote the number of transmit and 
receive antennas respectively, and C stands for the set of complex numbers. The beamforming vectors are 
determined by the SVD of the MIMO channel, i.e., H = UAV^ where U and V are unitary matrices, 
and A is a diagonal matrix whose s th diagonal element, \ s E M + , is a singular value of H in decreasing 
order, where M + denotes the set of positive real numbers. When S streams are transmitted at the same 
time, the first S vectors of U and V are chosen to be used as beamforming matrices at the receiver and 
the transmitter, respectively. In the case of GCMB, the number of streams S = 2. 

The received signal is 

Y = AX + N, (2) 

where Y is a 2 x 2 complex- valued matrix, and N is the 2x2 complex- valued additive white Gaussian 
noise matrix whose elements have zero mean and variance N = S/SNR. The channel matrix H is 
complex Gaussian with zero mean and unit variance. The total transmitted power is scaled as S in order 
to make the received Signal-to-Noise Ratio (SNR) SNR. 



Let x denote the signal set of M-QAM. Then the ML decoding of © is obtained by 



X = min||Y- AX| 



(3) 



where j G {!,••• , 4}. 



III. Diversity Analysis 

In this section, diversity analysis of GCMB is carried out by calculating the Pairwise Error Proba- 
bility (PEP) between the transmitted codeword X and the detected codeword X. For ML decoding, the 
instantaneous PEP is represented as 



Pr ( X -> X I H ) = Pr (j|Y - AX|| 2 > ||Y - AX|| 2 I H 
= Pr U > ||A(X-X)|| 2 | H 



(4) 



where £ = Tr{ — (X — ~X.) H A H ~N — N^A(X — X)}. Since £ is a zero mean Gaussian random variable 
with variance 2A ||A(X — X)|| 2 , © is given by the Q function as 



Pr X ->■ X | H = Q 



|AfX-X)|| 2 



2N 



(5) 



By using the upper bound on the Q function Q{x) < \e x2 ^ 2 , the average PEP can be upper bounded as 



Pr (X -> x) = E Pr [ 

< E 



Pr X ->■ X I H 



2 GXP 



|A(X-X) 

4A n 



(6) 



Define xi = [si, S2] T and x 2 = [S3, s 4 ] T . Then the Golden Code codeword can be represented as HI 

X = diag(G Xl ) + diag(Gx 2 )E, (7) 

where 



G 



V5 



a — i 




1 




,E = 




0-i 




i 



and diagff 1, ■ • • ,v s ] denotes a diagonal matrix with diagonal entries v 1, • • • ,v S - Let gj with j — 1, • • • , S 
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denote the j th column of G T , then gj denotes the j th row of G. Equation (QQ) is then rewritten as 



X 



Therefore, 



AX 



T T 

Si x i Si x 2 
isl*2 g^x x 



Aigfxi Aigfx 2 
iA 2 gf , x 2 A 2 g 2 r xi 



(8) 



(9) 



Then, 



|AX|| 2 = Tr{X H A^AX} 



A?(|gf Xl | 2 + |g{x 2 | 2 ) + A'dg^xxl 2 + (g^l 



(10) 



Let xi = [s%, s 2 ] and x 2 = [s 3 , s 4 ] denote the detected symbol vectors. By replacing xi and x 2 in (flOl) by 
xi — xi and x 2 — x 2 , © is then rewritten as 



Pr (X ->• x) < E 



I / Pi A 2 + p 2 A^ 



2 6XP 



4iV n 



(11) 



where 



Pi 



?2 T (X 2 -X 2 )| 2 ) 



P2 = (|g 2 (xi -x x ; 

The upper bound in (TTTI) can be further bounded by employing a theorem from lfi~5l which is specified 
below. 

Theorem 1: Consider the largest S < min(A^, N r ) eigenvalues p s of the uncorrelated central N r x N t 
Wishart matrix that are sorted in decreasing order, and a weight vector p = [pi, • • • , ps] T with non-negative 
real elements. In the high SNR regime, an upper bound for the expression E[exp(— 7 ^f =1 p s p s )], which 
is used in the diversity analysis of a number of MEMO systems, is 

s 



E 



exp -7 psPs 



where 7 is signal-to-noise ratio, C, is a constant, p 
non-zero element in the weight vector. 



■nin = min{pi, • • ■ , ps}, and 5 is the index to the first 



Proof: See (T5). I 
Note that p 1 > and p 2 > 0, then 5 = 1. By applying Theorem Q] to (fTTT) . an upper bound of PEP is 



Pr (X -> Xj < C 
Hence, GCMB achieves the full diversity order of 4. 



minWa) ^ 
4iV 



-JVriVt 



(12) 



IV. Decoding 

Equation © shows that each element of AX is only related to xj or x 2 . Consequently, the elements of 



AX can be divided into S groups, where the j group contains elements related to Xj and j 
The input-output relation in © then is decomposed into two equations as 



S. 



yi = 


^2,2 




Aigfxi 
_ A 2 g 2 r xi _ 


+ 


N 1A 
. N2 > 2 . 


Y2 = 


*1,2 
*2,1 




Aigfx 2 
iA 2 g 2 r x 2 


+ 


N 2A 



(13) 



where Yj k and Nj <k denote the (j, k) th element of Y and N respectively. Let ii! 
n 2 = [Nip, N 2 a] t , then (fT3l) can be further rewritten as 

yi = AGxx + ri!, 
y 2 = $AGx 2 + n 2 , 

where 



[^1,1, ^2, 2 ] T and 



(14) 



1 

i 



The input-output relation of (TT4l) implies that xi and x 2 can be decoded separately. Indeed, each 
relation of (fT4l) has a form similar to FPMB presented in lfl4l . In lfT6l . ifTTl . a reduced complexity SD 
is introduced. The technique takes advantage of a special real lattice representation, which introduces 
orthogonality between the real and imaginary parts of each symbol, thus enables employing rounding (or 
quantization) for the last two layers of the SD. When the dimension is 2 x 2, it achieves ML performance 
with the worst-case decoding complexity of O(M). This technique can be employed to decode 2x2 
FPMB or GCMB. Moreover, lower decoding complexity can be achieved for GCMB because of the 



special property of the G matrix. 

By using the QR decomposition of AG = QR, where R is an upper triangular matrix, and the matrix 
Q is unitary, (fl4l) is rewritten as 



Yi = Q^Yi = Rxi + Q/V = Rx x + n 1; 

y 2 = Q H ^ H y 2 = Rx 2 + Q H $ H n 2 = Rx 2 + n 2 . 



(15) 



Let fj denote the j column of 

Ai(l + z/3) Ai(a-i) 
Aa(H-ia) A 2 (/?-z) 

where j = 1, • • • , S. The elements of R are calculated as 



AG 



(16) 



Ri,i — ||fi||, 

< f 2 , fi > 



|fil 



(a-fl(A?-Aj) 
llfill 



(17) 



Rl,2 

R2,i = o, 

R2,2 = ||d 2 ||, 

where d 2 = f 2 — ( n^jjlp^p )fi. < f2,fi >= f^fi* and Rj^ denotes the {j,k) th element of R. Based on 
(fTTT) . the R matrix is proved to be real-valued, which means the real and imaginary parts of (fT5l) can be 
decoded separately. Consequently, (fT5l) can be decomposed further as 



(18) 



K{y 1 } = RK{x 1 } + K{fi 1 }, 
S{y 1 } = R3{x 1 } + S{fi 1 }, 
K{y 2 } = RK{x 2 } + 3ft{fi 2 }, 
3{y 2 } = R3{x 2 } + 3{n 2 }, 

where 9ft{v} and 3{v} denote the real part of imaginary part of v respectively. 

To decode each part of (TTSl , a two-level real-valued SD can be employed plus applying the rounding 
procedure for the last layer. As a result, the worst-case decoding complexity of GCMB is 0{yfM). 

Previously, the ML decoding of GC was shown to have the worst-case complexity of C(M 2 5 ) J6]|, 0. 
However, the above analysis proves that this complexity can be reduced substantially to only 0(y/M) by 
applying GCMB when CSI is known at the transmitter. Furthermore, the complexity of GCMB is lower 
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than the full-diversity full-multiplexing FPMB as well. The worst-case decoding complexity of 2 x 2 
FPMB is 0(M) with the decoding technique presented in QU, IfTTll . 

V. Extension to larger dimensions 

In [fT8l . the Golden Code is generalized to Perfect Space-Time Block Code (PSTBC) in dimensions 2, 
3, 4 and 6, which have the full rate, the full diversity, nonvanishing minimum determinant for increasing 
spectral efficiency, uniform average transmitted energy per antenna, and good shaping of the constellation. 
In |fT9l , PSTBCs have been generalized to any dimension. However, it is proved in 11201 that particular 
perfect codes, yielding increased coding gain, only exist in dimensions 2, 3, 4 and 6. In this section, 
GCMB is generalized to larger dimensions of 3, 4, and 6 by transmitting the corresponding PSTBC 
through multiple beamforming. 

The codewords of a PSTBC are constructed as 

s 

X = diag(Gx i )E^ 1 (19) 

3=1 

where S is the system dimension, G is a S x S unitary matrix, Xj is a S x 1 vector whose elements are 
information symbols, and 

1 ••• 
1 ... : 

e= ; = ; , 

o 1 

g ••■ 

with 



i, S = 


= 2,4, 


e 3 , 6 


= 3, 


2-7TZ _ 

— e 3 & 


= 6. 



The selection of the G matrix for different dimensions can be found in lfi~8l . 

In the sequel, the technique which transmits PSTBC through multiple beamforming is called Perfect 
Coded Multiple Beamforming (PCMB). Similarly to GCMB, the received signal of PCMB can be repre- 
sented as ([21). PCMB achieves the full diversity order, which can be proved by analyzing the PEP in a 
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similar way to Section [Till 

Similarly to GCMB, the elements of AX for PCMB are related to only one of the Xy, thus can be 
divided into S groups. The received signal is then divided into S parts, which can be represented as 

y ; <l>,AGx, n ; . (20) 

where <frj = diag(0 j l , • • ■ , (j)j t g) is a diagonal unitary matrix whose elements satisfy 

f 1, l<k<S + l-j, 
<Pj k — \ 

{ g, S + 2-j <k<S. 

By using the QR decomposition of AG = QR, where R is an upper triangular matrix, and the matrix 
Q is unitary, and moving <frjQ to the left hand, (1201) is rewritten as 

% = Q H ®fy 3 = Rxj + Q H $f rij = R Xj + hj. (21) 

For the dimension of S = 4, the R matrix in (|2TT) is real-valued, which can be proved in a similar way 
to Section [IV] Consequently, the real part and the imaginary part of Xj can be decoded separately, in a 
similar way to GCMB. Real-valued SD with the last layer rounded can be employed to decode (|2TI) . The 
worst-case decoding complexity of PCMB is then 0(M 1 - 5 ). Regarding MIMO systems using PSTBC, 
the worst-case decoding complexity is C(M 13,5 ) by using a similar decoding technique to [0, 0. For 
FPMB, ML decoding can be achieved by using SD based on the real lattice representation in |fi~6l , [fTTl , 
plus quantization of the last two layers, and the worst-case complexity is 0(M 3 ). 

For the S = 3 dimension case, the R matrix is complex-valued. Therefore, the real and the imaginary 
parts of Xj cannot be decoded separately, unlike the case of S = 2,4. Moreover, since the M-HEX 
constellation [21] is used instead of M-QAM, a complex- valued SD is needed. The worst-case decoding 
complexity of PCMB is then C(M 3 ). In the case of general MIMO systems using PSTBC, the worst-case 
decoding complexity is 0(M 9 ). For FPMB, the worst-case decoding complexity is 0(M 2 ). 

In the case of S = 6, which is similar to S = 3, the R matrix is complex- valued, and M-HEX signals are 
transmitted. Consequently, the worst-case decoding complexity is 0(M 6 ), while the worst-case decoding 
complexity of general MIMO systems using PSTBC and FPMB are C(M 36 ) and 0(M 5 ) respectively. 
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VI. Simulation Results 

Considering 2x2 systems, Fig. [2] shows BER-SNR performance comparison of GCMB, FRMB, and 
general MIMO systems using the Golden Code, which is denoted by GC, for different modulation schemes. 
The constellation precoder for FPMB is selected as the best one introduced in Ifl4ll . Simulation results 
show that GCMB, GC, and FPMB, with the worst-case decoding complexity of 0(VM), C(M 2 5 ), and 
O(M), respectively, achieve very close performance for all of 4-QAM, 16-QAM, and 64-QAM. The 
performance differences among these three are less than ldB, and become smaller when the modulation 
alphabet size increases. 

In the case of 4 x 4 systems, Fig. [3] shows BER-SNR performance comparison of PCMB, FPMB, 
and general MIMO systems using the PSTBC, which is denoted by PC, for 4-QAM and 16-QAM. The 
constellation precoder for FPMB is also chosen as the best one in lfi~4l . Simulation results show that PCMB 
has approximately 3dB and ldB performance degradations compared to PC and FPMB, respectively, 
and the degradations decrease as the modulation alphabet size increases. However, the performance 
compromises of PCMB trade off with reductions of the worst-case decoding complexity for PC and 
FPMB from C(M 13 ' 5 ) and C(M 3 ) to only C(M 15 ), respectively. 

VII. Conclusion 

In this paper, GCMB which combines the Golden Code and multiple beamforming technique is pro- 
posed. It is shown that GCMB achieves full-diversity, full-rate, and low decoding complexity. Compared 
to general MIMO systems using the Golden Code, GCMB has similar performance while the worst-case 
decoding complexity is reduced from C(M 2 ' 5 ) to only 0(\/M), when square A/-QAM is used. The 
substantial complexity reduction benefits from the knowledge of CSI at the transmitter. Moreover, the 
complexity of GCMB is also lower than 2x2 FPMB, which is a full-diversity full-rate beamforming 
technique without channel coding, with the worst-case decoding complexity of O(M). Similarly, GCMB 
and FPMB have very close performance. 

GCMB is generalized to PCMB in dimensions 3, 4, and 6. PCMB combines PSTBC with multiple 
beamforming. Similarly to GCMB, PCMB reduces the worst-case decoding complexity of general MIMO 
systems using PSTBC from C(M 9 ), C(M 135 ), and C(M 36 ) to C(M 3 ), C(M L5 ), and C(M 6 ) in di- 
mensions 3, 4, and 6, respectively. Compared to FPMB, the worst-case decoding complexity of PCMB is 
lower than 0(M 3 ) of FPMB in dimension 4, while due to the complex- valued R matrix and the HEX 
signals, higher than 0(M 2 ) and 0(M 5 ) of FPMB in dimensions 3 and 6, respectively. 
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Fig. 1. Structure of GCMB. 
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Fig. 2. BER vs. SNR for GCMB, GC and FPMB for 2 x 2 systems. 
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Fig. 3. BER vs. SNR for PCMB, PC and FPMB for 4 x 4 systems. 



